Information on data:

The following data is on New Orleans tornado building damage during December 2022. This data was obtained from Verisk Analytics and it was derived computer vision and machine learning using post-catastrophe aerial imagry data. There are approximately 42,000 buildings in this dataset.


Clean data:

I converted roof_solar into a T/F statement, by converting “SOLAR PANEL” to TRUE and “NO SOLAR PANEL” to FALSE. In addition to this, I converted the roof shapes that the computer wasn’t very sure about (up to a 20% chance of being incorrect) into NA. There were some cells in damage_level where they were filled with an empty character, so I converted that into NA as well. I then separated longitude and latitude so that it could be easily read into leaflet.


Define damage categories:

Catastrophe scores are separated by the summary of the dataset, excluding the catastrophe scores of 0.

mostdamage <- df %>% filter(catastrophescore >= 50)
# middamage <- df %>% filter(catastrophescore < 50 & catastrophescore > 25)
# leastdamage <- df %>% filter(catastrophescore < 25 & catastrophescore != 0)
nodamage <- df %>% filter(catastrophescore == 0)
decimated <-df %>% filter(catastrophescore == 100)

# # Might work better based on quartiles of attributes selected
middamage <- df %>% filter(catastrophescore < 50 & catastrophescore >= 15)
leastdamage <- df %>% filter(catastrophescore < 15 & catastrophescore >= 2)
minimaldamage <- df %>% filter(catastrophescore == 1)

Damage maps:

See if there are better markers to use for this: see If i can add labels with the long, lat, roof shape, and catastrophescore.


These are the buildings that sustained damage

Red indicates the buildings that were the most damaged (catastrophe score >= 50), orange indicates (25 < catastrophe score < 50), blue indicates (catastrophe score <= 25, excluding scores of 0). The majority of the buildings (3852) exhibited a catastrophe score of 0.

Map of the buildings that experienced damage:

Map of the buildings that experienced the most damage:

Map of the buildings that experienced mid damage:

Map of the buildings that experienced the least damage:

Map of the buildings that experienced no damage:

Map of the buildings that experienced no damage and the most damage:


Models:

## catastrophescore ~ long + lat + roofshape + rooftree + roofmateri
## <environment: 0x120ed3380>
## catastrophescore ~ long + lat + trampoline + deck + pool + enclosure + 
##     divingboar + waterslide + playground + sportcourt + primarystr + 
##     roofsolar + rooftree + roofmateri * roofshape
## <environment: 0x128a5c0b0>
## catastrophescore ~ long + lat + enclosure + roofmateri + roofsolar + 
##     rooftree + roofshape
## <environment: 0x125530a40>

Check models

Model 1

Model 2

Model 3

Predictions:

I’m planning to add predictions and additional plots for the completed final project.

Interpretations of predictions and models used:

This will also be updated.